PAPEL: A Dictionary-Based Lexical Ontology for Portuguese
نویسندگان
چکیده
This paper describes a project aimed at creating a lexical ontology extracted (semi) automatically from a large Portuguese general dictionary. Although using machine readable dictionaries to extract semantic information is not new, we believe this is the first attempt for the Portuguese language. The paper describes a (to be) freely available resource, dubbed PAPEL, explaining the process used and the tools developed, and illustrating it with one specific relation: Causation.
منابع مشابه
PAPEL: a lexical ontology for Portuguese
PAPEL (Palavras Associadas Porto Editora Linguateca) is a lexical resource for natural language processing (NLP) of Portuguese which is being built by Linguateca, based on processing a major commercial Portuguese dictionary, the Dicionário da Ĺıngua Portuguesa (DLP) [1] developed and owned by the largest Portuguese dictionary publisher, Porto Editora. There are known similar lexical resources f...
متن کاملExtracting Lexical-Semantic Knowledge from the Portuguese Wiktionary
Public domain collaborative resources like Wiktionary and Wikipedia have recently become attractive sources for information extraction. To use these resources in natural languague processing (NLP) tasks, efficient programmatic access to their contents is required. In this work, we have extracted semantic relations automatically from the Portuguese Wiktionary and compared our results with the re...
متن کاملRelations extracted from a Portuguese dictionary: results and first evaluation
This paper presents PAPEL, a lexical resource for Portuguese, consisting of relations between terms, extracted by (semi) automatic means from a general language dictionary. An overview on the construction process is given, the included relations are presented and a quantitative vision is provided together with some examples. Synonymy relations were evaluated using a thesaurus as a golden standa...
متن کاملEnriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary
In this article we present an exploratory approach to enrich a WordNet-like lexical ontology with the synonyms present in a standard monolingual Portuguese dictionary. The dictionary was converted from PDF into XML and senses were automatically identified and annotated. This allowed us to extract them, independently of definitions, and to create sets of synonyms (synsets). These synsets were th...
متن کاملBeyond the automatic construction of a lexical ontology for Portuguese: resources developed in the scope of Onto.PT
Besides the lexical ontology itself, during the Onto.PT project other resources were developed. Those included handcrafted grammars for extracting semantic relations, a term-based lexicalsemantic network extracted from dictionaries, a thesaurus with fuzzy memberships, polarities assigned to the Onto.PT synsets, as well as resources used for evaluation, such as manual mappings between words and ...
متن کامل